首页> 外文OA文献 >Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction
【2h】

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction

机译:混响会议语音的时延估计:关于多通道线性预测的使用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Effective and efficient access to multiparty meeting recordings requires techniques for meeting analysis and indexing. Since meeting participants are generally stationary, speaker location information may be used to identify meeting events e.g., detect speaker changes. Time-delay estimation (TDE) utilizing cross-correlation of multichannel speech recordings is a common approach for deriving speech source location information. Research improved TDE by calculating TDE from linear prediction (LP) residual signals obtained from LP analysis on each individual speech channel. This paper investigates the use of LP residuals for speech TDE, where the residuals are obtained from jointly modeling the multiple speech channels. Experiments conducted with a simulated reverberant room and real room recordings show that jointly modeled LP better predicts the LP coefficients, compared to LP applied to individual channels. Both the individually and jointly modeled LP exhibit similar TDE performance, and outperform TDE on the speech alone, especially with the real recordings.
机译:有效且高效地访问多方会议记录需要使用会议分析和索引编制技术。由于会议参与者通常是固定的,因此发言者位置信息可以用于识别会议事件,例如检测发言者变化。利用多通道语音记录的互相关性的时延估计(TDE)是推导语音源位置信息的常用方法。通过从每个语音通道上的LP分析获得的线性预测(LP)残差信号计算TDE,可以研究改进的TDE。本文研究了将LP残差用于语音TDE的情况,其中残差是通过对多个语音通道进行联合建模而获得的。用模拟混响室和真实室录音进行的实验表明,与应用于单个通道的LP相比,联合建模的LP可以更好地预测LP系数。单独和联合建模的LP都表现出相似的TDE性能,并且仅在语音方面表现优于TDE,尤其是在真实录音方面。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号